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When nodes can repeatedly update their behavior (as in agent-based models from computational social science 
or repeated-game play settings) the problem of optimal network seeding becomes very complex. For a popular 
spreading-phenomena model of binary-behavior updating based on thresholds of adoption among neighbors, we con¬ 
sider several planning problems in the design of Sticky Interventions: when adoption decisions are reversible, the 
planner aims to find a Seed Set where temporary intervention leads to long-term behavior change. We prove that 
completely converting a network at minimum cost is fl(ln(OPr))-hard to approximate and that maximizing con¬ 
version subject to a budget is (1 — i)-hard to approximate. Optimization heuristics which rely on many objective 
function evaluations may still be practical, particularly in relatively-sparse networks: we prove that the long-term 
impact of a Seed Set can be evaluated in 0(|f?|^) operations. For a more descriptive model variant in which some 
neighbors may be more influential than others, we show that under integer edge weights from {0,1, 2,..., A:} objective 
function evaluation requires only 0{k\E'\^) operations. These operation bounds are based on improvements we give 
for bounds on time-steps-to-convergence under discrete-time reversible-threshold updates in networks. 

Keywords: Network seeding, Spreading phenomenon. Combinatorial optimization. Convergence 


1 Introduction 


In the social and behavioral sciences there is a growing interest in the descriptive power of agent-based 
models. The term agent-based describes a broad class of models in which many independent agents 
repeatedly update their behavior in response to local interactions with other agents. After some time, 
emergent properties of the global system may be observed. Popular agent-based modeling tutorials by 
well-known social sci entist s and simulation resear c hers h ave been cited hundreds of times (such as Ax¬ 
elrod and Tesfatsion (2006) and |Vtacal and North ( 2010 )). Many features in agent-based models build 
on ideas that mathematicians and computer scientists have studied classically as cellular automata or 
somewhat more recently as spreading-phenomena models. 

In fields where controlled experiments are very difficult, agent-based models may offer a useful (if 
approximate) synthetic experiment to shed light on the effects of proposed interventions. Given an agent- 
based model, a natural question follows: what does the model suggest about the form of the best interven¬ 
tions? What types of interventions will yield long-term improvements? Depending on the specification of 
agent behavior, and on how heterogeneously-structured the patterns of agent interaction are, this planning- 
oriented question can raise serious computational and algorithmic challenges. Most positive theoretical 


ISSN 1365-8050 © 2016 by the author(s) 


Distributed under a Creative Commons Attribution 4.0 International License 














2 


G.Spencer 


results we are aware of on maximizing influence in networks focus on a kind of “one-shot” influence 
(adoption decisions are irreversible). In this paper we focus on seeding interventions when nodes may 
repeatedly update their behavior (adoption decisions are reversible), as in most agent-based models. 

In network settings, very few agent-based models proposed by social scientists yield optimization prob¬ 
lems which have efficient accurate algorithms[|], so heuristic methods like genetic algorithms and local- 
improvement algorithms are a natural recourse. These heuristic methods may demand many function eval¬ 
uations, and may require significant replication to ameliorate issues associated with settling at local optima 
while missing global optima. For example, running a local-improvement algorithm from 1,000 random 
initial solutions may give an experimenter substantial peace of mind. Unfortunately, for increasingly- 
complex agent-based simulations, even evaluating the predicted effect of an intervention may be quite 
slow/computationally costly (and not simply as a matter of poor implementation). Some agent-based 
models may not even converge to a consistent ordering on the quality of interventions: how long should 
the agent behavior be allowed to evolve before the relative strength of two interventions is assessedip^ 
Optimizers and mathematicians interested in encouraging computationally-responsible reasoning in the 
social sciences (replication to reduce artifacts and the design of consistent evaluation metrics) should em¬ 
phasize the variable computational costs associated with this diligence. Despite forbidding anecdotes, in 
many cases, replication and convergence to a consistent ordering are reasonable demands. In this paper, 
we prove hardness-of-approximation results for spreading a {0, l}-behavior with reversible adoptions, 
but we also give quadratic bounds on function evaluation and describe near-convergence from an arbitrary 
pattern of initial adoption. 

Further, as new sources of data from large online-social networks become available (e.g. variable 
strength of ties/variable sharing across links), what combinations of these these model features can be 
incorporated without jeopardizing manageable convergence (and compute) times? Researchers may pre¬ 
emptively limit descriptive power for fear that repeatedly evaluating ever-more complex simulations will 
be prohibitively slow for instances of real-world scale. Yet some powerful descriptive model features may 
carry only mild computational costs. For our model, adding a variable-integer-edge-weight feature causes 
only a modest increase in our function-evaluation bound. Including data-driven model features may in¬ 
crease the specificity of simulations describing real systems that social scientists care about. We believe 
that there is a role for mathematical theory in helping scientists understand the computational costs of 
incorporating model features that capture new data sources. 


Our focus. Spreading-phenomena models of threshold-based binary-behavior in networks have a long 
history. Granovetter’s foundational work in the sociology literature in the late 70s motivated computa¬ 


tionally inquiry on the effect of variation in (often synthetic) network structure (see Granovetter (1978)). 


For example. Watts studied the distribution of long-term cascade sizes as a function of network structure 


under threshold-spread models (see Watts (2002)). More recently, these models have been studied exten- 
sivelyin_sociolog^_asJ|Com£le}^_Contagion7_For_exum£lej_seeJCentolu_and_^ac^_(^007|j_an^Centola 


et al. (|2007 ). Threshold-based behavior updates also arise in behavioral economics models, as in Nyborg 
et al. ( 2006 ). Empirical work on Twitter data has suggested that the Compl ex-Contagion model i s useful to 
understand spread of political activism in real-world contact networks (see Romero et al. ( 2011 )). Thresh¬ 
old decision rules also have provenance in game theory where they emerge as best-response strategies for 


Or even satisfying approximation-algoritlims which are efficient and provably near-optimal. 

This is particularly problematic as the choice of time-step interval is often somewhat artificial in agent-based models. 
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repeated game play in networks]^ 

We consider a model of binary {0, l}-behavior spreading in a network in which nodes update behavior 
based on a threshold of adoption among their neighbors. These threshold values are given explicitly in 
the input and may be heterogeneous. We assume that nodes will adopt a desirable behavior (Behavior 
1) only when they observe enough of their neighbors choosing Behavior 1. If too few neighbors adopt 
Behavior 1, a node will adopt Behavior 0. As in the game-theoretic setting, we allow all agents to update 
concurrently and repeatedly. 

If a planner can temporarily incentivize a small set of nodes to adopt Behavior 1, what does this 
threshold-based spread model suggest about form of highly-effective interventions? For a given budget 
there can exist a exponentially-large space of spatial interventions. Though the behavior-update process 
does not necessarily converge to a stable pattern, there is a som ewhat s atisfy ing notion of the long-term 
adoption caused by an intervention (this follows from a result of Goles ( 1985 )). 

We show that four natural variations of this intervention-planning problem are hard-to-approximate; 
no polynomial-time algorithms can exist without substantial compromise in the guaranteed quality of 
the solution. Can we still learn something about what these models predict about the form of effective 
interventions? 

Given a behavior-spread model, nearly the bare minimum required to explore optimized interventions is 
the ability to repeatedly evaluate the objective function. For example, methods like local search algorithms 
and genetic algorithms involve repeated computation of the objective function value. Are heuristics which 
require many function evaluations reasonable? For discrete-time reversible-threshold binary-behavior 
updating we show that evaluating the long-term effect of an intervention takes 0{\E\'^) operations where 
E is the edge set for the network. The constant hidden by the big O notation is relatively small (at 
most 8 or 12 for the most general cases), and for an edge-weighted variant (which can describe variable- 
strength ties in a social network) in which integer weights come from the set {1, 2,3,..., fc} at worst 
function evaluations cost 0(fc|i?p) operations, still with constant at most 8 (or 12). The foundation of 
these bounds on compute time are new time-to-convergence results we prove for threshold automata in 
networks: behavior updating is guaranteed to converge in 2|i?| -f |y| time steps (or 2k\E\ -f \V\ time steps 
for the edge-weighted case). 


(ill) Pq[. background on game play in networks, see the widely-read textbook of iiasley and Kleinberg 


(MIC), 
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2 Model: Discrete-Time Reversible-Threshold Spread 

Input: Finite network G = {V, E). 

Each node i €V has an integer threshold bi (the bi may be non-uniform). 

Initial State Vector: At each time t = {0,1,2,3,...}, each i G V has one of two behaviors: 

Behavior 0 or Behavior 1. The Adoption Vector , x{t) G {0, describes the pattern of adoption 

at time t. ^ An initial adoption vector, a;(0), is given. Note that non-zero. 

Planner Intervention at Seed Set: 

The planner specihes a d-time-step intervention at subset V' C V. 

For all i G V, set 2:^(0) = 1. 

Evolution of Adoption: 

At each time f > 1, each i G V updates behavior: 

• Decision Rule: If (i G V' AND t < d — 1), then set Xi{t) = 1. 

Otherwise, node i updates based on behavior of neighbors at t — 1 : 

If I^jeS(z) “ 1) ^ *en set Xi(t) = 1. 

If I^jeS(i) “ 1) < bi, then set x^{t) = 0. 

° If Xi{t) = 1 this indicates node i adopts behavior 1 at time t, etc. 

These inputs and decision rule specify a sequence of binary vectors a:(0), a:(l), a:(2),... that describes the 
evolution of adoption of behavior in G. 

Suppose that a planner can temporarily intervene to force a subset of nodes to adopt Behavior 1 for 
the d time steps {0,1, 2,..., d — 1} (at time d these nodes resume normal updating). As other nodes 
update according to the decision rule. Behavior 1 may start to spread in the network. As is common, we 
refer to the subset of nodes where the planner intervenes as the Seed Set. Subject to constraints on the 
size or cost of the Seed Set, our planner wants to choose the Seed Set which results in highest long-term 
adoption (long after the temporary intervention has ended). Qualitatively, the planner would like to choose 
a Sticky Intervention for which x{k) contains many Is for arbitrarily large k. An intuitive definition of the 
Long-term Rate of Adoption is 

lim '^Xi{k) 
k—^oo L ' ^ J 


Unfortunately, for the decision rule dehned, x{k) may not converge. When the limit above does not 
exist, x{k) eventually alternates between two vectors (from |Goi^ (1985))p^ , so dehne the Long-term 
Rate of Average Adoption as 


lim 

k—¥oo 


['^^[xi{k) + x,{k + 1)^ 
ieV ^ 


For an example of a repeating cycle of length 2, see the appendix. In a computational setting, a useful stopping condition for 
function evaluation should incorporate this possible settling at a 2-cycle behavior. 
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Our planner seeks to maximize this well-defined objective function. There are two natural notions of what 
it means to intervene temporarily. 

Duration of Intervention: 

• temporary (t); The planner chooses a Seed Set and intervenes continuously until growth of Behav¬ 
ior 1 stops|^ then stops intervening. 

• fixed-duration(fd): The planner chooses a Seed Set and intervenes for d consecutive time steps, 
then stops intervening. 

Comment (Adoption is reversible): In this model, nodes may update from Behavior 0 to Behavior 1, or 
from Behavior 1 to Behavior 0. In particular, updates from 1 to 0 may occur in cascades of Behavior 0 
that start at f = 1 based on an unstable pattern of adoption in the initial state vector, a:(0), or in cascades 
of Behavior 0 that start when the Seed Set V is no longer forced to Behavior 1. 

Planning Objective: 

• Problem 1: Min-cost Complete Conversion (MCC) What is the smallest cardinality Seed Set 
required to convert the entire network to Behavior 1 permanently? 

• Problem 2: Budgeted Maximum Conversion (BMC) If the Seed Set contains at most k nodes, 
what is the maximum number of nodes that can be permanently converted to Behavior 17 

3 Seeding the Stickiest Intervention is Hard to Approximate 

The following table summarizes our results on hardness of approximation for the four variants of the 
seeding problem defined. 


Summary of Results on Hardness of Approximation: 



temporary (t) 

fixed-duration (fd) 

Min-Cost 

Complete Conversion 

f?(ln(OPr))) 

f?(ln(OPT)) 

(provided that d > 2) 

Budgeted 

Maximum Conversion 

< 0.632 

< (1 _ i) « 0.632 
(provided that d> 2) 


For Min-cost Complete Conversion, we prove that no polynomial-time algorithm guarantees a 0(ln(0PT))- 
approximation (unless NP has slightly superpolynomial time algorithms). This is by reduction from 
the Set Cover problem, for which Feige proved a ln(n) threshold for efficient approximation (see Feige| 
( 1998| )). Notably, the lower bound we give here is a function of the size of the optimal Seed Set. For 
Budgeted Maximum Conversion, we prove that no polynomial-time algorithm can guarantee more than a 
(1 — i)-fraction of the optimal value. This is by reduction from the Maximum-Coverage Problem, where 
we again leverage a hardness result of Feige. 


Or, the growth of the 2-time-step average level of Behavior 1 stops. 
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Before proving these hardness results, we comment on directions for positive algorithmic results. Given 
our (1 — i) inapproxamability result for Budgeted Maximum Conversion, readers may wonder whether 
the (1 — |) -approximation algorithm for the suggestively-named “Linear-Threshold Model” of Kempe 
et al. ( 2005| ) suggests a useful direction for the problems dehned here. Unfortunately, the positive al¬ 
gorithmic result in Kempe, et al. relies on a very specialized model assumption about the form of un¬ 
certainty in threshold values. This assumption is used to prove that adoption is a submodular function 
of the set of seedsj^ Known results on maximizing submodular functions subject to budget constraints 
then immediately guarantee the success of certain greedy approaches to solution construction. In fact, 
submodularity-dependent analysis i nforms much of the theo retical computer science work on influence 
in networks (for example, also see Mossel and Roch ( 201C )). We emphasize that the objective in our 
problems is not submodular, and greedy approaches may fail dramatically for very simple examples. 

Consider the Set Cover Problem dehned as follows. Let S denote a set of elements {1, 2,..., n}. Let F 
denote a group of subsets of S which we will denote Ji, J2, ..., J\f\- Call a set of indices I a Set Cover if 


Uig/Ji = S. 


The goal is to hnd a Set Cover of minimum cardinality. 

Theorem 1 The Set Cover Problem can be reduced in polynomial time to an instance of Min-cost Com¬ 
plete Conversion (for any period of seeding d > 2). As a result, Min-cost Complete Conversion is 
U(ln(OPT)) hard to approximate. 


Proof: Given an arbitrary instance of the Set Cover Problem, construct an instance of Min-cost Complete 
Conversion as follows. For each element i G S, create a node Xi. Denote this set of “element nodes” 
by Vs- For each Ju G F create a node xj^. Denote this set of “subset nodes” by Vp- Our constructed 
instance has node set U = Us U Vp. For every (i, Jfc)-pair with i G S and Jk G F, if i G Jk then include 
edge (xi, xjf) in edge set E. Notice that (U, E) gives a bipartite graph. Let the threshold for each node 
Xi G Us be 1. Let the threshold for each node xj,, G Vp be the degree of xj^., which by construction is 

iJkl- 

We make two observations about this class of constructed instances. First, for any seed set in the 
constructed instance, all updates to Behavior 1 that will ever occur happen by the end of f = 2. To 
see this, consider Us and Vp separately. Suppose Xi G Vs is not a seed; Xi can only be converted to 
Behavior 1 if there exists some Jk G E with i G Jk that has XJ^, a seed. Otherwise, Xi is adjacent only 
to non-seed subset nodes; due to the construction of the thresholds for Vp, such non-seed subset nodes 
are only converted to Behavior 1 when every node corresponding to a contained element already has 
Behavior 1. Since Xi does not have Behavior 1, such a subset node will not be converted to Behavior 
1 (and consequently will never cause nodes corresponding to its elements to adopt Behavior 1). Thus, 
any non-seed Xi G Vp that adopts Behavior 1 will do so because it is adjacent to a seed node. Since the 
threshold for Xi G U 5 is 1, adoption of Behavior 1 must happen at f = 1. Since any subset node is only 
adjacent to element nodes, if the hnal non-trivial updates for element nodes occur at f = 1 , then the final 
non-trivial updates for a subset node must be at f = 2 . 

Second, suppose that a seed set contains a subset node xj^. If xj^. is forced to adopt Behavior 1 for 
at least t G {0,1} then must adopt Behavior 1 for all t. This follows from our construction of the 

Kempe et al. assume that each node i chooses a fractional threshold uniformly from [0,1]. Submodularity no longer holds if for 
arbitrary e > 0, each node i chooses a fractional threshold uniformly from [e, 1]. 
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Where F = {Jk,Ji 7 Jm-; ■■■7 
and S = {i, a, b, c,z}. 


For example, Jk = {i, a, c}, etc. 


Fig. 1: Schematic of constructed bipartite graph. The constructed instance of Min-cost Complete Con¬ 
version. Each node in Vs has threshold 1. Each node in Vf has threshold equal to the cardinality of the 
corresponding subset Jk- 


thresholds. Since xj^ has Behavior 1 at f = 0, at f = 1 all element nodes adjacent to xj^ will have 
Behavior 1. As a result, at f = 2, xj^. will freely choose Behavior 1 without being forced, and also, all 
element nodes adjacent to xj^. will freely choose to adopt Behavior 1 (since xj^ was forced to Behavior 
1 at f = 1). At later time steps updates are trivial: at t, xj,. observed all neighbors adopting Behavior 1 at 
t — 1, and all element nodes adjacent to xj^. observed xj^ adopting Behavior 1 at f — 1. Effectively, seeds 
which are subset nodes quickly “self-stabilize,” and do the same for their element-node neighbors. 

Now solve the constructed instance of Min-cost Complete Conversion where d > 2 (the Seed Set will 
at least be forced to Behavior 1 for t G {0,1}). Call the returned Seed Set Q. We explain how to massage 
Q to find a seed set that only contains subset nodes which has size at most \Q\ and still converts all of V 
to Behavior 1. This massaged Q will have a natural interpretation as a Set Cover for S of cardinality at 
most IQj. 

Suppose that the seed set Q contains Xi G Vs- If all neighbors of Xi are also seeds, then Xi can be 
removed from Q to obtain a strictly smaller seed set that converts all nodes by f = 2 (the only change is 
that Xi will now convert to Behavior 1 at f = 1 causing at most a 1 time step delay in other updates to 
Behavior 1). Otherwise there exists some xj, ^ Q with i G Ji- In this case, massage Q by removing 
Xi and adding such a xj^. Any subset node whose adoption of Behavior 1 depended on Xi still adopts 
Behavior 1 by f = 2 (since now Xi adopts Behavior 1 at f = 1, as i G Ji). Any other node updates altered 
by this substitution result in Behavior 1 at a node in the place of Behavior 0. Thus, the massaged version 
of Q still converts all of V to Behavior 1 by f = 2 (by construction of the node thresholds, full adoption 
is stable for t > 3). Repeat this removal/massaging procedure until Q contains only nodes corresponding 
to subsets from F. 

Now interpret the subsets corresponding to nodes of massaged Q as a proposed set cover I of cardinality 
at most IQ I- As verified above, the massaged seed set Q still converts all nodes of V to Behavior 1 by 
t = 2. Since non-seed element nodes in Vs are converted only by adjacency to subset nodes which are 
seeds (from our first observation), every element node in Vs must be adjacent to some seed from massaged 
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Q. By construction of edge set E, this means that alH S S' appear in some subset indexed by I. Thus, / 
is a set cover containing at most \Q\ sets from F. 

Finally, the optimal value for the Set Cover problem cannot be strictly less than the Min-Cost Complete 
Conversion optimal value, since every set cover L with = S corresponds to a seed set in our 

constructed instance of Min-Cost Complete Conversion which converts all of V to Behavior 1 (all element 
nodes are converted to Behavior 1 at f = 1 due to the covering property, and consequently all non-seed 
subset nodes are converted at f = 2). 

Due to the correspondence demonstrated between solutions for an arbitrary Set Cover instance and 
solutions of the same numerical value for our polynomially-constructed instance of Min-cost Complete 
Conversion (for arbitrary d > 2), an a-approximation algorithm for Min-cost Complete Conversion im¬ 
mediately gives an a-approximation algorithm for Set Cover. Thus Min-cost Complete Conversion in¬ 
herits f2(lnn)-hardness from the Set Cover Problem where n = IS”! (this hardness holds unless NP has 
slightly superpolynomial time algorithms, see Feige (1998) for details). Notice that [S'! corresponds to 
I Vs I in our constructed Min-Cost Complete Conversion instance: our massaging procedure shows that 
OPT < |Vs|, so r2(ln(|Vs|))-hardness of approximation certainly implies fl ln(OPr))-hardness of ap¬ 
proximation for Min-Cost Complete Conversion. □ 


A similar reduction (which introduces dummy nodes to stabilize element nodes and creates a complete- 
subgraph gadget on Vp to avoid permanent conversion of subset nodes) shows that the Budgeted Maximum- 
Coverage Problem can be reduced in polynomial time to our Budgeted Maximum-Conversion Problem. 
Again, Budgeted Maximum Conversion inherits a hardness due to Feige (1998): unless P = NP there is 
no approximation algorithm that can guarantee a solution strictly better than a (1 — i) fraction of optimal. 


Theorem 2 The Budgeted Maximum Coverage Problem can be reduced in polynomial time to an instance 
of Budgeted Maximum Conversion (for any intervention length d > 2). As a result. Budgeted Maximum 
Conversion is {1 — j)-hard to approximate. 

The details of this proof appear in the appendix. 


4 Computing the Effect of Intervention: 

Long-Term Average Adoption Rate 

Given our hardness results in the previous section, we turn our attention to the feasibility of optimization 
heuristics. Almost the bare minimum required by an optimization heuristic is the ability to repeatedly 
evaluate the objective function associated with a candidate feasible solution. In our case, what is the long¬ 
term effect of a particular seed set? If function evaluations are too-expensive computationally, it may limit 
the size of instances where researchers can reasonably conduct computational studies. For Discrete-time 
Reversible-threshold binary behaviors we give an upperbound on the number of operations required to 
compute the Long-Term Average Adoption Rate which is quadratic in the cardinality of the edge set: 

Theorem 3 The long-term average adoption rate can be computed in 0(|£’p) operations. 

Theorem ^ follows immediately from new bounds we give in Section [4-.l| on the number of time steps 
required before guaranteed convergence to the long-term average adoption rate. In each time step, the 
number of operations required to update all nodes according to adoption in their neighborhoods is 0(|£'|), 
as each node x must sum over |i5(a:)| terms (for a total of 2\E\ terms over all nodes). 
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The bound in Theorem || is for the more complicated cases: fixed-duration intervention, and tempo¬ 
rary intervention from an arbitrary initial state vector x(0) € {0, For a temporary intervention 

maintained until growth of Behavior 1 stops starting from initial state with | |a:(0) 11 = 0, convergence in 
0(|y I) time steps to a stable pattern of long-term adoption is obvious: the evolution of adoption may be 
cleanly divided into a first phase is which adoption of Behavior 1 is growing (during the intervention), and 
a second phase in which adoption of Behavior 1 is being eroded (after the intervention). Each phase lasts 
at most \V\ time steps: if a time step elapses with only trivial behavior updates (each node maintains their 
behavior from the previous time step) then no non-trivial updates can possibly occur in future time steps. 


In the the more complicated cases, adoption of Behavior 1 and erosion of Behavior 1 can occu r simul¬ 
taneously in G. Bounding time-until-convergence is significantly more subtle for these cases. Further, 


in these cases the long-term rate of adoption may not be well defined. Instead, a simple long-term average 
rate may be reliably computed after convergence to a cycle that oscillates between 2 adoption vectors. We 
summarize our results in the table below. 


Summary of Results on Stability and Convergence to Long-term Average Adoption: 



Discrete-Time Reversible- 
Threshold Seeding (DRSeed) 

Weighted-Neighbor 

Variant of DRseed 

temporary (t) 

Given ||a:(0)|| = 0 

converges to: stable adoption vector 
convergence time bound: 2\V\ 

converges to: stable adoption vector 
convergence time bound: 2|V| 

temporary (t) 

From arbitrary a;(0) 

converges to: 2-cycle (from Goles (1985)) 
convergence time bound: 2(2|ii)| -f |I^|) 

converges to: 2-cycle (from poles (1985)) 
convergence time bound: 2{2k\E\ -f |V|) 
(for edge-weights from {0,1, 2,..., fc}) 

fixed-duration (fd) 

From arbitrary a:(0) 

converges to: 2-cycle (from poles (|1985)) 
convergence time bound: d-\-2\E\ -|- \V\ 

converges to: 2-cycle (from poles| (|1985 )) 
convergence time bound: d -f 2k\E\ + \ V\ 
(for edge-weights from {0,1, 2,..., fc}) 


Before the proofs for the fixed-duration cases we make the following comment. How much could our 
convergence bounds be improved? Though the adoption vector evolves in an exponentially-large space, 
for temporary interventions our bound on convergence time is linear in the number of nodes. In the more 
general fixed-duration setting we give an upperbound on convergence time that is linear in the number of 
edges in the network. Considering a network which is a simple line graph shows that the best possible 
upper bound on adoption-vector convergence time is \V\. We note that the average degree in social 
networks is often bounded by a constant : for such networks all upper bounds given are linear in the 


number of individuals in the network, so that at most a constant-factor improvement could be given. 

In the next section, we harness special properties of our model to tighten a convergence argument given 
in |Goi^ ( [l9^ for a more general class of threshold automata. In particular, our bound is linear in the 
size of the edge set of the network regardless of degree distribution (whereas Goles (1985) stated looser 
bounds and only claimed linear convergence time for uniform-degree networks). Our proof extends almost 
immediately to the case in which edges have weights from the set {0, 1,2, ...,k} for constant k. 


(«i) jvjQtg (ijat when | |x(0) 11 = 0, the temporary variant corresponds to a special case of the fixe d -durat ion variant when d > n. 
(viii) often understood as a consequence of Dunhar’s social brain hypothesis (see Dunhai (1992l). 
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4.1 Convergence from an Arbitrary Pattern of Initial Adoption 

For adoption vector x{t) G {0, Ill'll, let Nx(t) denote the set of indices i for which Xi{t) = 1. The 
following lemma describes a basic property of our economic-threshold decision rule (which is not true of 
the more general class Goles studied). We establish it now for use near the end of our convergence-bound 
proof. 

Lemma 1 (Monotonicity of the future adoption set in the current adoption set) Let x(f) and x'{t) denote 
two adoption vectors at time t. Let x(T) denote the result of applying the update rule to x(t) for T — t 
steps, andlet x'(T) denote the result of applying the update rule to x'(t)forT— t steps. If C 
then for all T > t, C N^nj-y 


Proof: (of Lemma ^ By induction, starting at time t. From the assumptions of the theorem, C 

N^i(y^y Induction hypothesis: suppose that for time T — 1 we have ^ ^x'{t-i)- Consider time 

step T. For each node i & V: \f i n N^(ypy then it must be the case that at least bi neighbors of i are 
in N^(j-_iy Since all nodes which are in in (by the induction hypothesis), at 

least bi neighbors of i are in N^i(^j^_iy By the update rule for i, we get that x'i(T) = 1, so that i G N^iyj-y 
Thus, W‘a;(T) C N^/yj-y □ 


Theorem 4 (Main Theorem: Convergence Bound for Reversible Economic-Threshold Spread) 

In graph G = (V, E), given an arbitrary initial adoption vector x(0) G {0, Ill'll' 

within 2\E\ + \ V\ time steps the evolving adoption vector will converge to a cycle of length at most 2. 


Next, we explain the proof of this main theorem, which requires a number of intermediate lemmas. 
The d e cision rule of our model is a special case of the general threshold-based update rule analyzed in 
Goles| ( 1985 ). We follow Goles analysis closely, but by exploiting the monotonicity of our restricted case 
(established in Lemma |^, and emphasizing a more combinatorial description of a key function, we give 
significantly tighter results for our model. 

Proceeding forward, we use the fact (from Goles (1985)) that for a specified set of integer thresholds 
denoted bi for i G V (our model as described until now), replacing bi with bi — 0.5 for alH G L gives 
an update procedure indistinguishable from updating that uses the original bi. 0 Thus, without loss of 
generality, we assume all thresholds are half-integer. 

The following lemma is directly from Goles (1985), but we include a proof for completeness. 

Lemma 2 (Adoption enters a cycle, |Go/ej| (1985)) Given any x{0) G {0, Ill'll there exists an integer 
c with the property that x(f) = x{t -f c) = x(t + 2c) = ... and that x(t) is not equal to any of x(t -f 
1), x(t + 2),..., x(t -f c — l)/or all t above some Transient Time T. 


Proof: (of Lemma ^ The space of possible adoption vectors, {0, Ill'll, is finite. Thus, there exists a 
time step at which some adoption vector y G {0, Ill'll occurs for the second time. Since the update 
process is deterministic, the behavior from that point forward will be identical to the evolution after the 
first occurrence of y, giving a cycle. □ 


We’ll define and analyze a special function E(x{f)). In contrast to the analysis in Goles ( 1985 ), we de¬ 
scribe E(x{f)) in terms of the update process in the network. To do this we introduce the idea of sightings. 


E.g. replacing a threshold of 3 at node i with a threshold of 2.5 changes nothing about how the adoption vector will change. 
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Given an adoption vector y e {0, Ill'll, we say that node i sights each of its neighbors which is 1 accord¬ 
ing to yj^ The expression for E{x{t)) will include two terms related to sightings. 

Sightings Necessary to get x{t): 

If x{t) occurs during our updating process, we can give a bound on how many sightings must have 
happened in the graph at time f — 1; if i is 1 at f, it must have sighted at least bi neighbors which were 1. 
Thus, throughout the graph at least 

iv^l 

biXi{t) = {b, x{t)) = (Sightings Necessary to get x{t)) 

i=l 

sightings must have happened at f — 1. This counts accurately: bi is included exactly when Xi{t) is 1. 

Sightings wasted in tnrning on a: (f -I-1): 

The adoption vector x{t) produces the next state x{t -\- 1): each node i that is 1 in xit -I- 1) saw at least bi 
sightings in x{t), but i may also have sighted some extra neighbors at 1 beyond the bi that were required. 
We say these sightings were wasted in turning on x{t -f 1). Let Ai denote the *th row of the adjacency 
matrix for G.[^ The number of sightings that were wasted at node i is Aix{t) — bi. 

So, summing over i: 


Wl 


'^{Aix{t) - bi){xi{t + 1)) = {{Ax(t) - b), xit + 1)) (1) 

i=l 

= (Sightings wasted in turning on x{t -\- 1)) (2) 

This counts the correct quantity because Aix{t) — bi is included in the sum exactly when Xi{t + 1) = 1. 

Defining E{x{t)): 

E{xit)) : = (Sightings Necessary to get x{t)) — (Sightings wasted in turning on xit -\- 1)) (3) 

= {b,xit)) - iiAxit)-h),xit+l)) (4) 


Lemma i Uotes ( 1985 ) If the adoption vector has not entered a 2-cycle (aka, assuming xit) ^ xit-\-2)), 
then Eixit)) is decreasing: 

Eixit -\- 1)) -I- 0.5 < Eixit)). 


Proof: (of Lemma ^ The proof of this lemma is largely from Goles ( 1985 ): we include the details for 
completeness and because they are critical to explaining the subsequent improvements we make. 


In terms of our update process: are the number of sightings i makes at x{t) greater than fe;? If so, then i is I at time f + 1. 

The adjacency matrix of G is the \ V\ X \ V\ matrix that has Aij = 1 exactly when the edge {i,j) is in G and has all other entries 


0 . 
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We will show E{x{t)) — E{x{t + 1)) > 0.5. 

E{x{t)) - E{x{t + 1)) = (5) 

= { 6 , x{t)) — {{Axit) — b), x{t + 1 )) - {b, x{t + 1 )) + {{Ax{t + 1) - b), x{t + 2 )) ( 6 ) 

= {b, x{t)) - {{Ax{t)),x{t + 1 )) + {{Ax{t + 1) - b), x(t + 2 )) (7) 

= ( 6 , x{t)) - {{Ax{t + 1)), x{t)) + {{Ax{t + 1) - b), x{t + 2 )) ( 8 ) 

= —{{Ax(t + 1) - b),x{t)) + {{Ax{t + 1) - b),x{t + 2 )) (9) 

= {x{t + 2 ) — x{t), {Ax(t + 1) — b)) (10) 


In the algebra above: first we make a combined term for all sightings at t by nodes which are 1 at f + 1, 
next we switch the order of summation: sightings by Is from f + 1 at x{t) are the same as sightings by Is 
from t at x{t + 1) (which is alternately true from A a symmetric matrix), then simply combine terms. 
The righthand side of the equality sums the following term over all nodes i: 

[xi{t + 2) - x^{t)]{Aix{t + 1 ) - bi) ( 11 ) 

If i has the same state in x{t + 2) and x{t) then the term for i has value 0. 

If Xi{t + 2) = 1 and Xi{t) = 0: Xi(t + 2) = 1 means Aix(t + 1) — > 0.5 

If Xi{t + 2) = 0 and Xi{t) = 1: Xi(f + 2) = 0 means Aix{t + 1) — bi < —0.5 

These facts follow from the half-integrality of the bi. In both cases where Xi{t + 2) ^ Xi{t), the term 
for node i contributes at least 1/2 to the value of E{x{t)) — E{x{t + 1)). Since we assumed we are not 
in a 2-cycle yet (aka that x{t) ^ x{t + 2)) there must be at least one i that contributes value 1/2. This 
concludes the proof of Lemma ^ □ 

We can now prove that the evolving adoption vector will converge to a cycle of length at most 2. 

Proof: (State of “Convergence” claimed in Theorem Suppose the cycle guaranteed by Lemma ||has 
length c > 2 : x{t) = x{t + c) and x{t) is not equal to any of x{t + 1), a:(f -|- 2),..., x{t -I- c — 1). We can 
apply Lemma || 

E{x(t)) > E(x{t + 1)) > E{x{t + 2)) > ... > E{x(t + c)). (12) 

Since x{t) = x{t + c), we also have that E{x(t)) = E(x{t + c)). This gives a contradiction. Thus c < 2.0 


It remains to prove how long this convergence will take. Following Goles, we showed in Lemma || that 
E(x{t)) decreases in every time step unless the process has converged to its final 2-cycle, so bounding the 
range of E(x{t)) will give an upper bound on the transient time of the process. This is still our general 
strategy, but to give an improved upper bound over that from 3oles| ( |1985| ), we use the monotonicity of 
our process (Lemma to show that unless the process is already very close to a 2-cycle, the decrease in 
E(x{t)) is at least twice as large per time step as specified by Lemma ||: 


Lemma 4 Unless the evolving adoption vector is within 2|I^| time-steps of entering a 2-cycle, at least 2 
nodes i have xft) x ft -\- 2), so that 


E{x{t + 1)) + 1 < E{x(t)). 
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Proof: (of Lemma ^ Suppose that x(t + 2) differs from x{t) in only one position. We’ll show that the 
adoption vector enters a 2-cycle within 2|y I time steps. denote the subset off^ corresponding to 

nodes that are at 1 in x(t). Suppose that Xi{t-\-2) = 1 and Xi{t) = 0, and for all other j, Xj(^t-\-2) = Xj{f). 
By definition, C We use monotonicity (Lemma to reason about in the following 

time steps. 

Consider it results from sightings at Since C A^a;(t_|_2), all sightings at 

happen at Thus, is a superset of N^(j.j^iy any node that decided to adopt based on x{t) 

will certainly adopt based on x{t -\- 2). By the same rationale, N^y-i-i) ^ Nx{t-i- 3 ) gives that Nx(j^j^ 2 ) ^ 
Nx(t+i)- Then Nxy+ 2 ) Q N^y+i) gives that N^y+s) Nxy+ 5 ), etc. This argument holds iteratively. 
To make this more clear, write the set of adopters in a z-pattern using N^y) ^ Nxy+i) to denote that the 
set of adopters at time f -f 1 results from the set of adopters at time t, and add the subset relationships: 


Nx{t) 


^x{t+l) 

ni 


ni 

Wx(t+2) 

=> 

-^a;(t+3) 

ni 


ni 

b^x{t + 4:) 

=> 

^x{t-\-5) 

ni 


ni 

-^x(t+6) 


-^a;(t+7) 


Since the update process is deterministic, if any of these C relationships is not proper then the adop¬ 
tion vector has entered a 2-cycle. That is, if lVa;(t-i-fc) = Nx(t+k+ 2 )^ then it also must be the case 
that Nx(t+k+i) — ^x(t+k+ 3 )^ so that for all time steps after t -\- k the adoption vector alternates be¬ 
tween x(t -\- k) and x(t -f fc -f 1). The maximum number of time steps that could elapse before a 
C relationship is forced to be non-proper is 2|L|: since all the subsets must be proper, at least one 
node is in {Nx(t-vk-i- 2 ) \ ^x(t-i-k)) for all k. The longest path that could ever be achieved is if N^y) is 
the empty set and Nx(t-v 2 \v\) is the entire set V. After this many time steps it must be the case that 
= Nx;(t+ky the adoption vector has entered a 2 -cycle within 2 |L| time steps. 

A symmetric argument (with opposite direction of C relationships) establishes the case where Xi (t -f 
2) = 0 and Xi{t) = 1, and for all other j, Xj{t -\- 2) = Xj{t). We have established that if x{t -\- 2) 
differs from x{f) in only one position the adoption vector will enter a 2 -cycle within 2 |L| time steps. 
Equivalently, if the adoption vector is more than 2|L| time steps from entering a 2-cycle, then it must be 
that x{t -\- 2) differs from x{t) in strictly more than 1 position. Thus, evaluating the final expression from 
the proof of Lemma ||: 

E{x{t)) — E{x{t 1)) = {x{t -f 2) — x{t), (Ax{t -f 1) — b)) > 1. (13) 

This concludes the proof of Lemma ^ □ 
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Lemma 5 The range of values that E{x(t)) can achieve is < 2\E\. 

Proof: (of Lemma 5) Recall the definition of E(x{f)): 

E{x{t)) = (Sightings Necessary to get x{t)) — (Sightings wasted in turning on x(t + 1)) (14) 

From our definitions of necessary and wasted sightings, the first term is always positive (or 0) and the 
second term is always negative (or 0). Letting the first term be as large as possible, and the second term 
be as small as possible, we obtain an upper bound on E{x{f)) of (every node in \V\ makes the 

necessary sightings for it to adopt). 

An obvious lower bound on E{x{t)) assumes the first term is 0 and makes the second term as large in 
magnitude as possible (as many wasted sightings as possible at every node): 

|y| |v| |y| 

E{x(^t)) > -^(deg(i) - 6 j) = -^(deg(i)) (15) 

i—1 i—1 i—1 

Thus the range of E(x{t)) is at most the upper bound minus the lower bound: 

|y| |y| |y| |y| 

H = 2\E\. (16) 

i—1 i—1 i—1 i—1 

This concludes the proof of Lemma ^ □ 

Proof: (Time to Convergence claimed in Theorem ^ The range of E{x(t)) is at most 2\E\ from Lemma 
H At most 2\V\ time steps can decrease E{x(t)) by only 1/2. Every other time step must result in 
a decrease in E{-) of size at least 1. Thus, from any initial adoption vector a;(0) there are at most 
(2|L| + (2|i?| — |L|)/1) = 2|i?| + \V\ time steps before the adoption vector enters a stable state or 
a 2 -cycle. □ 


Extension: When Some Relationships Are More Influential. To our model input, add that each edge 
e € E has a weight We from the set of integers {0,1, 2,..., k}, and modify the decision rule as follows. 

• Decision Rule: For each node / G L, if the edge-weighted sum of neighbors of i who adopt 
Behavior 1 at / — 1 is at least bi, then node i adopts Behavior 1 at time t. Otherwise node i adopts 
Behavior 0 at time t. 

Our proof generalizes immediately. The matrix A is now the weighted adjacency matrix. The definition of 
E{x{f)) is generalized so the terms describe the weighted amount of sightings to get x(t) and the wasted 
amount of weighted sightings to get x{t + 1). Sums of degrees become sums of weighted degrees. The 
range of E{x{t)) is now 2 — ‘^k\E\. 


Theorems (Convergence Bound: Weighted-Neighbor Reversible-Threshold Updating). 

In graph G = {V,E), where each edge has weight Wg from {0,1,2, given an arbitrary initial 

adoption vector a;(0) G {0, Ill'll.- within (2 I^D — {‘^k\E\ |L|) time steps the evolving 

adoption vector will converge to a cycle of length at most 2. 
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Motivated by the prevalence of repeated behavior updating in the computational social sciences litera¬ 
ture, we considered the planning problem of designing Sticky Seeding Interventions in Networks under 
reversible-threshold discrete-time updating. We proved that several natural variants of the planning prob¬ 
lem are hard-to-approximate (by reductions from Set Cover), but we also provided quadratic bounds on 
function evaluation (even from arbitrary initial states of adoption). Thus, optimization heuristics that 
repeatedly evaluate the objective may be practical, even when empirically-motivated features like het¬ 
erogeneous edge weights are added to the model. Incorporating model features that represent new data 
sources (like variable edge-weights) increases the specificity of predictions, potentially allowing valuable 
contrasts in qualitative properties of optimized interventions across different networks. 

Our long-term spread objective that considers reversible adoption and heterogeneous edge weights is 
already complicated to evaluate, even though the model we consider in this paper is entirely deterministic. 
A prominent alternative to the game-theory-style concurrent-updating we’ve assumed is random asyn¬ 
chronous updating where nodes update in a random order one at a time.^ It is easy to construct small 
examples where random asynchronous updating leads to strongly-different long-term behavior than con¬ 
current updating (even in an expected-value sense), or where the randomly-realized long-term outcomes of 
asynchronous updating vary enormously. It is unclear whether these types of differences might somehow 
be damped at a larger scale. Efficient exact expected long-term spread evaluation in this asynchronous 
context seems impossible due to the exponentially-large space of possible update orders, though perhaps a 
strong assumption about the structure of the input network (as frequently appears in the statistical physics 
literature) might provide some traction. 

We close with a general comment. The study of spread in networks gives rise to a number of fascinat¬ 
ing theoretical and applied questions. Variations in model assumptions and network structure can cause 
dramatic changes in the qualitative behavior of the system and in the form of optimized interventions. 
Many positive theoretical results rely on rather-special assumptions (for example, very-specific structural 
restrictions, or highly-special forms of uncertainty in the input instance). A significant future challenge 
will be to understand which messages about networks are “stable” against some variation in these assump¬ 
tions. While it may seem that theoretical reasoning, synthetic computational exploration, and scientific 
investigation of real networks are diverging fields, we believe that each of these areas has rich insights to 
offer to its counterparts. 
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Appendix 

Example of 2-cycle 

As mentioned in Section ^ the adoption vector may not converge to a single repeating vector. 


1 • 


1 •--• 1 

2 


1 • 

Fig. 2: (An alternating cycle of length 2 with average adoption 50%) Nodes are marked with their thresholds. 
Start from an initial pattern of adoption in which only the center node has Behavior 1. The adoption pattern cycles 
between two vectors; average adoption is 50%. 


Figure 2 demonstrates an example in which iterated application of the decision rule does not converge 
to a single repeating adoption vector. In the example in Figure 2, an average rate of long term adoption 
can be computed because the adoption vector enters a small cycle (of length 2). 

The long-term behavior of the update rule is always at least as stable as the example in Figure 2: after 
a sufficient number of time steps the adoption vector will either be a stable repeating vector or alternate 


between 2 adoption vectors.We show in the main text that convergence to such an alternating state 


happens within 0(|i?|)-time steps (where the hidden constant is small). 


Hardness of Budgeted Maximum Conversion 

Here we give the details of the proof of Theorem ^ from Section || 

Consider the Budgeted Maximum Coverage Problem defined as follows. Let S denote a set of elements 
{1, 2,..., n}. Let F denote a group of subsets of S which we will denote Ji, J 2 , ■■■, J\f\- Given a budget k, 
the objective is to specify a set of k indices I so that the cardinality of the following union is maximized: 


Ci(zj Ji- 


(17) 


(xiii) gygjj j-Qj. more general class of threshold automata Goles studied. 
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Theorem 2 The Budgeted Maximum Coverage Problem can be reduced in polynomial time to an instance 
of Budgeted Maximum Conversion (for any intervention lasting at least 2 time steps). Ai a result, Budgeted 
Maximum Conversion is (1 — ^)-hard to approximate. 

Proof: (of Theorem^ Given an arbitrary instance of the Budgeted Maximum Coverage Problem, con¬ 
struct an instance of Budgeted Maximum Conversion as follows. For each element i G S, create a node 
Xi. Denote this set of “element nodes” by Vs. For each node Xi € Vs create a unique dummy node yi. 
Denote this set of dummy nodes by Vq. For each Jk G F create a node xj^. Denote this set of “subset 
nodes” by Vp. Our constructed instance has node set V = Vs U Vd C Vp. For every (i, Jfc)-pair with 
i G S and Jk G F: if i G Jk then include edge (xi, xj^) in edge set E. For each i G S, include the edge 
{xi, Vi) in E. Finally, for each pair of subset nodes in Vp, include the edge between them in E. Let the 
threshold for each node 0 :^ € Vs be 1. Let the threshold for each node € Vd be 1. Let the threshold 
for each node XJ^, G Vf be the degree of xj^, which by construction is \ Jk\ + 1^1 — 1. Let the budget 
for seeding be k nodes (assume fc < |F| — 1 as otherwise the best Maximum-Coverage index set could be 
found by enumeration in polynomial time). 

We make two observations about this class of constructed instances. First, no subset node will adopt 
Behavior 1 except in time steps in which it is forced to do so. This follows from our construction of the 
high thresholds for xj^.: the maximum number of neighbors of xj^ adopting Behavior 1 is the number of 
seeds (fc) plus the number of elements in Jk, for a total of \ Jk\ + k < | J^j -|- jfl — 1 (this is insufficient 
for xjj, to freely choose Behavior 1). Thus, even if all element nodes adopt Behavior 1, among the subset 
nodes only Seed nodes will adopt Behavior 1 (and only while the intervention is in place). 

Second, suppose that a Seed Set contains a subset node xj^,. If XJ^, is forced to adopt Behavior 1 for at 
least t G {0,1} then all of the element nodes adjacent to xj^ must adopt Behavior 1 for all t. This follows 
from our construction of the thresholds. Since xj^. has Behavior 1 at f = 0, at f = 1 all element nodes 
adjacent to XJ^, will choose Behavior 1. At f = 2 each such element node observes that XJ^, had (forced) 
Behavior 1 at < = 1 and so continues to adopt Behavior 1. Also at f = 2, the dummy node connected to 
each such element node will choose to adopt Behavior 1. Then, at f = 3, regardless of adoption by xj^, 
each element node and its dummy will observe each other adopting Behavior 1 at f = 2 and continue to 
adopt Behavior 1. All future time steps have the same trivial updates. Effectively, seeds which are subset 
nodes quickly cause their corresponding (element node, dummy node)-pairs to “stabilize” at Behavior 1. 

Now solve the constructed instance of Budgeted Maximum Conversion where d> 2 (the Seed Set will 
at least be forced to Behavior 1 for t G {0,1}). Call the returned Seed Set Q. From the budget constraint 
for seeding we have |Q| < k. We will massage Q to find a seed set that only contains subset nodes which 
has size at most |Q| and still converts as many nodes to Behavior 1 as Q does. This massaged Q will have 
a natural interpretation as a feasible solution for the Budgeted Maximum Coverage Problem for S. 

For i G S, suppose that either an element node Xi or a dummy node yi is in Q. If all Jk G E that 
have i G Jfc are in Q, then Xi (or yi respectively) can be removed from Q to obtain a strictly smaller Seed 
Set that converts to Behavior 1 all nodes that Q does. Otherwise, there exists some Ji G F with i G Ji 
and xjj ^ Q. In this case, massage Q by removing Xi (or respectively) and adding subset node Xj^. 
The resulting set of seeds still converts both Xi and y^ to Behavior 1 (since i G Ji this happens by f = 2 
even when d is as low as 2 by our second observation), and all other differences in node updates substitute 
Behavior 1 in the place of Behavior 0 (as now all element nodes-and their dummy copies- adjacent to 
xj, will permanently adopt Behavior 1 from our second observation). Repeat this removaPmassaging 
procedure until Q contains only nodes corresponding to subsets from F. 
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Now interpret the subsets corresponding to nodes of massaged Q as a proposed budgeted cover, /, 
of cardinality at most \Q\ < k. As veribed above, the massaged seed set Q still converts all nodes to 
Behavior 1 that Q did (and does so by f = 2). The number of elements from S covered by I is precisely 
half the number of nodes converted in our constructed Budgeted Maximum Conversion instance. 

Finally, the optimal value for Budgeted Maximum Coverage problem cannot be strictly more than 
half of the Budgeted Maximum Conversion optimal value, since every budgeted cover L whose union has 
I dig L Ji I elements corresponds to a seed set in our constructed instance of Min-Cost Complete Conversion 
which converts exactly 2| Uigi Ji\ nodes to Behavior 1 (all element nodes corresponding to elements in 
UigL Ji are converted to Behavior 1 at f = 1, and their dummy partners are converted to Behavior 1 at 
t = 2, and these nodes adopt Behavior 1 in all future time steps from our second observation). 

Due to the correspondence demonstrated between solutions for an arbitrary Budgeted Maximum Cover¬ 
age instance and solutions of exactly twice the numerical value for our polynomially-constructed instance 
of Maximum Budgeted Conversion (for arbitrary d > 2), an a-approximation algorithm for Maximum 
Budgeted Conversion immediately gives an a-approximation algorithm for Budgeted Maximum Cover¬ 
age. Thus budgeted maximum conversion (for arbitrary d > 2) inherits (1 — l/ e)-hardness f rom the 
Budgeted Maximum Coverage Problem (this hardness holds unless P = NP, see Feige ( 1998 ) for de¬ 
tails). □ 


A simpler reduction without the complete subgraph on Vp is possible if the threshold for a subset node 
can exceed the degree of the node. This seems somewhat abusive of the model, however, so we have 
included the argument as given. 





